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DATA PROCESSOR FOR IMPLEMENTING FORECASTING ALGORITHMS 

BY: 

Andrew W. Lo, Harry Mamaysky, And Jiang Wang 

FIELD OF THE INVENTION 

The present invention is generally directed to a data processing system and method 
for developing forecasts relating to future financial asset valuations. More specifically, the 
present invention is directed to a computer system for analyzing historical data relating to 

5 financial asset valuations and applying pattern recognition algorithms to develop forecasts of 
future asset valuations. 
STATEMENT OF RELATED CASE 

The present patent application relies and is based upon Provisional Application No. 
60/195,540, filed on April 7, 2000, entitled FOUNDATIONS OF TECHNICAL 

1 0 ANALYSIS: COMPUTATIONAL ALGORITHMS, STATISTICAL INFERENCE, AND 

EMPIRICAL IMPLEMENTATION, by Andrew W. Lo, Harry Mamaysky, and Jiang Wang, 
the disclosure of which is incorporated herein by reference. 

BACKGROUND OF THE INVENTION 
Perhaps the most famous disclaimer on Wall Street is that "past performance is no 

1 5 guarantee of commensurate future returns." While popular with the lawyers advising the 

promoters of mutual funds and the like, the above disclaimer is in fact contrary to an entire 
body of investment advisory techniques - known as technical analysis. Technical analysts 
collect large amounts of price data for select securities and graphically display the price v. 
time relationships for select periods. The resulting charts are then examined with the learned 

20 eye of the technical analyst with the hope of recognizing one or more familiar patterns in the 
recent price data on the chart. Through experience, patterns of price movements have been 
associated with future price increases (bullish sentiments) or alternatively, price drops 
(bearish sentiments). 



Technical analysis, on one level, is largely an exercise in human interpretation of 
complex graphics, as the data relating to price movements - say stock in a corporation - is 
charted in Cartesian coordinates for review. Applying this technique at its most basic level 
eschews traditional tools of investment analysts such as price-earning ratios, projected 

5 revenue growth, and the like. Indeed, the technical analyst - at this basic level - wholly 
ignores the attributes underlying the asset, focusing instead on the raw price data as a 
meaningful projector of future price movements. 

While many professionals on Wall Street feel uncomfortable about the usefulness of 
visual patterns in historical price movements as an important projector of short term future 

1 0 price movements, studies reflect its predictive capabilities. Indeed, it is a tool with 

significant potential for hedging, day-trading, momentum trading, managing mutual funds, 
risk management, and the like. Coupled with other analytical tools, and traditional 
investigatory techniques, sophisticated investors are greatly assisted in discerning the type of 
investment selection to be made - either for themselves or as advisors for others - such as 

1 5 large institutions, pension funds, or mutual fund companies. 

Pattern recognition is an important process in a number of fields. For example, 
forensic evidence relies on pattern recognition for correlating fingerprint samples, 
handwriting analysis, and face recognition. Sophisticated software has been developed to 
assist in handwriting recognition and optical character recognition ("OCR") apply computers 

20 to detect and assess patterns for translation into known logical syllogisms. 

Returning to Wall Street, recent trends include the application of large computer 
systems for intensive fundamental analysis in support of select investment strategies. While 
technical analysis, as discussed above, relies on human recognition of graphical patterns 
(shapes) in price - time charts, fundamental analysis applies established mathematical 

25 techniques - both algebraic and numerical. More grounded in accepted financial engineering 

principles, fundamental analysis was quickly adopted and branded as scientific, to contrast it 
from the more ethereal "technical analysts." This has resulted in a lesser role for technical 
analysis in investment forecasting and security selection. 
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It has been, however, established that forms of technical analysis do, in fact, provide 
meaningful predictions of future price trends. Within the thicket of subjective analysis and 
buried by the specialized jargon, a potentially powerful investment tool remained essentially 
unrealized. See, specifically, "Foundations of Technical Analysis: Computational 
5 Algorithms, Statistical Inference, and Empirical Implementation" by Andrew W. Lo, Harry 

Mamaysky, and Jiang Wang, published in The Journal of Finance, Vol. LV, No. 4, August 
2000, pp. 1705-1765 - the contents of which are incorporated by reference as if restated in 
full. It was with this understanding of the problems with the prior art that provided the 
impetus for the present invention. 
10 OBJECTS AND SUMMARY OF THE PRES ENT INVENTION 

It is thus an object of the present invention to provide an investment advisory system 
that applies technical analysis in an objectively analytical framework to develop projections 
on future pricing events of select securities. 

It is another object of the present invention to provide an investment management 
1 5 system for assessing current and past pricing trends on securities and, based thereon, 

formulate investment strategies. 

It is still another object of the present invention to provide an investment advisory 
method wherein past price data is plotted and assessed by software to detect patterns that are 
predictive of future price movements. 
20 It is yet another object of the present invention to provide a computer data processing 

system for collecting and organizing historical price data for select financial assets and 
processing the historical data to detect patterns predictive of future price movements for the 
select financial assets. 

It is a further object of the present invention to provide a data processing system that 
25 uses historical price and volume data to generate a non-linear relationship using a smoothing 
estimator to develop a pattern revealing numerical assessment for projecting future price 
movements. 
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It is still another object of the present invention to provide a feedback mechanism to 
adjust the smoothing estimator parameters based on the performance of past projection 
efforts. 

It is another object of the present invention to provide a data processing system to 
support distribution of price movement projections and recommendations to market traders. 

Some, if not all, of the above objects of the present invention are realized in a 
specific illustrative computer system implementation utilizing a smoothing estimator 
equation to provide a non-linear representation of a two-dimensional graphical pattern of 
price. The computer system first creates the non-linear relationship based on current and/or 
past pricing data for select periods and on specified intervals. The smoothing estimator 
reduces observational errors by averaging the data in sophisticated ways. The non-linear 
relationship is then tested for patterns known to be associated with a future price movement, 
with detection of such a pattern triggering a Results ("RSTOU)") output. This RST output is 
tested and then used to support predictions and recommendations based on expected future 
price movements, in the form of an numerical index - positive for bullish sentiments and 
negative for bearish sentiments, with the magnitude reflecting a confidence level. Variations 
of the index include the use of secondary factors, such as trading volume. 

Additional features of the present invention include the ability to test and modulate 
the smoothing estimator. Specifically, the system records the projections, RST, and then at 
some point in the future compares these to actual price movements. Failure to fit within a 
selected pricing criteria provokes a parameter modulation, wherein the co-efficients of the 
smoothing estimator are adjusted to enhance predictive accuracy. 

While the primary objective of the present invention is to recognize patterns that are 
now well known as predictors of future movement, such as "head and shoulders" and the 
"double bottom," a further object is to associate past price patterns - as yet unrecognized - 
with price movements, so as to ascertain patterns that have predictive value, but are 
otherwise visually hidden from human recognition. The present invention is applicable for 
predictions for the prices of many different types of securities, such as equities, debt 
instruments, futures, options, indexes, and other derivative instruments 
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The present invention is better understood by review of a specific illustrative 
embodiment thereof provided hereinbelow in a detailed description, including the following 
figures, of which: 

BRIEF DESCRIPTION OF THE FIGURES 

5 Figure 1 is a functional block diagram of the specific system components for use in 

practicing the present invention; 

Figure 2 is a logic defining block diagram describing system operation in support of 
the present invention; and 

Figure 3 is a logic defining block diagram further describing system operation in 
1 0 support of another aspect of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
First briefly in overview, the present invention is directed to a computer system 
programmed to process large amounts of data relating to asset transactions and/or pricing. 
The system includes either a dedicated database or a communication port linking to a remote 
1 5 data source for access to the requisite data. At select intervals, either event driven, periodic, 

or by request, data is pulled from the database and placed in CPU associated memory in 
preparation for processing. A smoothing estimator equation is selected and the data 
processed, creating the smoothly shaped, price curve. See the exemplary kernel regression 
described in the co-pending provisional patent application referenced above. This curve is 
20 then numerically (and globally) tested against stored predictive patterns ("PP"). A positive 
response triggers a result file, with a future price movement prediction. The Result is placed 
into a report ("report generator") for review and/or distribution on a selected basis, e.g., 

subscription and the like. 

The system includes various adjustable parameters for customized application. For 
25 example, data periods can be adjusted to cover short time frames - ranging from minutes to 

hours. Alternatively, the relevant time period can stretch to weeks or even months. In one 
embodiment, discussed below, price data is taken from "end of trade day" in New York. Of 
course, the time period selected will influence the predictive power of the associated and 
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detected shapes. In fact, the program parameters, such as h (bandwidth) will differ for the 
selected application of the smoothing algorithm. 

Over time, the system begins to detect and associate patterns that are otherwise 
difficult or impossible to visually discern. This learning and feedback function provides 
5 pattern associations not humanly readable, but otherwise predictive of future price 

movements. These are stored in the database with the visually detectable patterns and 
applied to historical price trends consistent with the process used for the original set of 
patterns. 

Data inputs to the system include data feeds on price, volume, and/or trade size and 

1 0 are received in digital form from data vendors, such as Reuters, Bloomberg, and Standard & 
Poor. System hardware will depend on the scale of operation and includes Sun 
Microsystems Enterprise Class Servers, and Terabyte systems disk arrays, applying Oracle® 
software controlled database. The System Results Output can be used internally to support 
trading or distributed on a fee-based subscription basis. 

1 5 With the foregoing overview in mind, attention is first directed to Figure 1 which 

provides a functional block diagram of the subsystems supporting the invention. A central 
(local) database is provided at block 10, storing price data and selected programming 
including numerical implementations of the requisite smoothing estimators. This local 
database is updated with current price data from subscription service, block 20, via 

20 asynchronous feed. At block 30, the CPU processes the price data and develops the shaped 
curves from the smoothing estimator. Once generated, the system tests the curve using 
numerical analysis against the stored library of predictive patterns, block 40. The results are 
processed at block 50 - report generator - and then passed to distribution, block 60. The 
results are further passed to the algorithm confirmation processor, block 70, stored and then 

25 later processed by reference to future (then current) pricing to discern predictive accuracy in 

the forecasting. Sequential operations is controlled by Counters I and J. 

The high level logic governing the processing of the system of Figure 1 is depicted in 
the flow chart of Figure 2. Logic conceptually begins at block 100, and entry of the User's 
password at block 1 10, authorizing access to the main menu for system configuration and 
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operation. In practice, the main menu includes many choices for the User to configure the 
system for predictive processing. This will include selection and configuration of the 
predictive algorithm, the parameters for use with the selected algorithm, and the period and 
sampling frequency. For illustrative purposes, the diagram depicts the simple processing of 
5 the selected algorithm, test 120 - with a negative selection ending processing. Once selected, 
the parameters are entered, block 130, controlling the curve generation. At block 140, the 
actual historical data is formatted in a system readable matrix dbase(I,J); this is concatenated 
with the selected current market data, block 150, and processed in the algorithm, block 160, 
to create the smooth curve. At test 170, the curve is tested for patterns - if positive, logic 
1 0 branches to block 1 80 and the system generates a results file, RST(I,J) providing the 
predictive association. Logic continues for the next sequence, via block 190. 

Turning now to Figure 3, as results are generated, these are used to compare with 
actual price movements and assess the predictive character of the patterns, and to modulate 
the algorithm to enhance predictive performance. Logic begins at block 200 and continues 
15 to the sequence of data loads, block 2 1 0, beginning with the results file. At block 220, the 

actual price movement results are loaded and coupled with the underlying historical price 
data, at block 230. The system runs a correlation for each such entry point in the data set 
and then assesses the confidence levels obtained from the predictions, block 240. 

A convergence criteria is pulled and then used to measure the character of the 
20 predictions. If the criteria is not satisfied, the algorithm is pulled, block 260, and the 
parameters adjusted, via test 270 and blocks 280-290. 

The operative characteristics of the present invention are better understood in the 
context of a specific example, provided below: 

Example 

25 The system and method begins with the premise that prices of a selected asset evolve 

in a non-linear fashion, but contain certain patterns within the noise. As provided 
in the above-noted (and incorporated) paper, quantitative analysis begins with the following 
expression: 

P,=m(JQ+e„ t = l,...,T, (1) 
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where m(X^) is an arbitrary fixed but unknown non-linear function of a state variable X t , and 
{e,} is white noise. The smoothing function m{\) is constructed by setting the state variable 
equal to time, X t - t. Exemplary smoothing functions include kernal regression, orthogonal 
series expansion, projection pursuit, nearest neighbor estimators, average derivative 

5 estimators, splines, and neural networks. To enhance the smoothing function, the system 
uses weighted averages of the P,s where the weights decline asX,s get farther away fromx 0 , 
providing a "local averaging" procedure in estimating m{x). The selected weighting must be 
sufficient to create a smoothed curve, but not so influential as to hide the pattern creating 
non-linearities in the data. 

! o More formally, for any arbitrary x, a smoothing estimator of m(x) may be expressed 



30 



as 

T 



m(x) = 1 I o) t (x)P t (2) 

20 where the weights { w,(jc)} are large for those P,s paired with X t s near x, and small for those 
P,s withes far from x. To implement such a procedure, we must define what we mean by 
"near" and "far." If we choose too large a neighborhood around x to compute the average, 
the weighted average will be too smooth and will not exhibit the genuine nonlinearities of 
m(:). ^ we choose t0 ° sma11 a neighborhood around x, the weighted average will be too 

25 variable, reflecting noise as well as the variations in m(-). Therefore, the weights { co t (x)} 
must be chosen carefully to balance these two considerations. 

The system here applies a kernal regression, with a weight function (o t (x), constructed 
from a probability density function K(x) 



K(x)zQ, jK(u)du = l. ( 3 ) 
By rescaling the kernel with respect to a parameter h > 0, we can change its spread, 



i.e., let: 
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K h {u) = 1 K(u/h), JK h {u)du = 1 
h 



(4) 



5 



30 



35 



and define the weight function to be used in the weighted average (2) as 

<»i,h(x) = K h (x - JQ/g A (x), (5) 



T 

I Kfc-JQ. (6) 
10 T m 

If h is very small, the averaging will be done with respect to a rather small neighborhood 
around each of theX,s. If h is very large, the averaging will be over larger neighborhoods of 
the X t s. Therefore, controlling the degree of averaging amounts to adjusting the smoothing 
1 5 paramerer h 9 also known as the bandwidth. Choosing the appropriate bandwidth is an 
important aspect of any local-averaging technique. 

Substituting (6) into (2) yields the Nadaraya-Watson kernel estimator m h (x) of m(x): 

T 

ZK h (x-X)Y t 

20 m h (x) = 1 Z <<>d x ) r t = -£! ( ? ) 

T t=l T 

25 Under certain regularity conditions on the shape of the kernel K and the magnitudes and 

behavior of the weights as the sample size grows, it may be shown that m h (x) converges to 
m(x) asymptotically in several ways (see Hardle (1990) for further details). This 
convergence property holds for a wide class of kernels. For illustrative purpose, we shall use 
the most popular choice of kernel, the Gaussian kernel: 



K„(x) = — ~ (8) 





hi 2n 
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Selecting the appropriate bandwidth h is clearly central to the success of m h (-) in 
approximating m(-)-too little averaging yields a function that is too choppy, and too much 
averaging yields a function that is too smooth. To illustrate these two extremes, Figure II 
(see "Foundations of Technical Analysis: Computational Algorithms, Statistical Inference, 
and Empirical Implementation" by Andrew W. Lo, Harry Mamaysky, and Jiang Wang, 
published in The Journal of Finance, Vol. LV, No. 4, August 2000, pp. 1705-1765 - the 
contents of which are incorporated by reference as if restated in full) displays the Nadaraya- 
Watson kernel estimator applied to 500 data points generated from the relation: 



where X t is evenly spaced in the interval [0, 2/7] . Panel 11(a) plots the raw data and the 
function to be approximated. 

Kernel estimators for three different bandwidths are plotted as solid lines in Panels 
H(b)-(c). The bandwidth in 11(b) is clearly too small; the function is too variable, fitting the 
"noise" 0.5 eZ t as well as the "signal" Sin(-). Increasing the bandwidth slightly yields a much 
more accurate approximation to Sin(-) as Panel 11(c) illustrates. However, Panel 11(d) shows 
that if the bandwidth is increased beyond some point, there is too much averaging and 
information is lost. 

There are several methods for automating the choice of bandwidth h in (7), but the 
most popular is the cross-validation method in which h is chosen to minimize the cross- 
validation function: 



y, = Sin(Z,) + 0.5eZ, 



eZ, ~ #(0,1), 



(9) 



T 



CV(/i) = l LiPt-mJ 2 , 



(10) 



T t=i 



where 



T 



™H = 1 E o) T , h y T . 

T Trt 



(11) 
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The estimator m h t is the kernel regression estimator applied to the price history {P T } with the 
t-th observation omitted, and the summands in (10) are the squared errors of the m h t s, each 
evaluated at the omitted observation. For a given bandwidth parameter h, the cross- 
validation function is a measure of the ability of the kernel regression estimator to fit each 
5 observation P t when that observation is not used to construct the kernel estimator. By 

selecting the bandwidth that minimizes this function, we obtain a kernel estimator that 
satisfies certain optimality properties, e.g., minimum asymptotic mean-squared error. 

Once the data is converted into a non-linear, smoothed function, this function is 
tested against the stored library of pattern defining highs and lows with the range. 

1 0 We focus on five pairs of technical patterns that are among the most popular patterns 

of traditional technical analysis (see, for example, Edwards and Magee (1966, Chapters VII- 
X)): head-and-shoulders (HS) and inverse head-and-shoulders (IHS), broadening tops 
(BTOP) and bottoms (BBOT), triangle tops (TTOP) and bottoms (TBOT), rectangle tops 
(RTOP) and bottoms (RBOT), and double tops (DTOP) and bottoms (DBOT). There are 

1 5 many other technical indicators that may be easier to detect algorithmically-moving 

averages, support and resistance levels, and oscillators, for example-but because we wish to 
illustrate the power of smoothing techniques in automating technical analysis, we focus on 
precisely those patterns that are most difficult to quantify analytically. 

Consider the systematic component m(-) of a price history {P^and suppose we have 

20 identified n local extrema, i.e. , the local maxima and minima, of m(-). Denote by E u E 29 

the n extrema and t* l9 1* 2 , ..../„ the dates on which these extrema occur. Then we have 
the following definitions: 

Definition 1 (Head-and Shoulders). Head-and-shoulders (HS) and inverted head-and- 
25 shoulders (IHS) patterns are characterized by a sequence of five consecutive local extrema 

E U ... 9 E S such that: 
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{ E { is a maximum 
{ 

{ E 3 >E 1 ,E 3i >E 5 
HS^ { 

{ E x and E 5 are within 1 .5 percent of their average 

{ 

{ E 2 and E A are within 1 .5 percent of their average, 



{ E x is a minimum 
{ 

{ E 3 <E x ,E 3i <E 5 
IHS = { 

{ E x and E 5 are within 1 .5 percent of their average 
{ 

{ E 2 and E 4 are within 1 .5 percent of their average, 

Observe that only five consecutive extrema are required to identify a head-and-shoulders 
pattern. This follows from the formalization of the geometry of a head-and-shoulders 
pattern: three peaks, with the middle peak higher than the other two. Because consecutive 
extrema must alternate between maxima and minima for smooth functions, the three-peaks 
pattern corresponds to a sequence of five local extrema: maximum, minimum, highest 
maximum, minimum, and maximum. The inverse head-and-shoulders is simply the mirror 
image of the head-and-shoulders, with the initial local extrema a minimum. 

Because broadening, rectangle, and triangle patterns can begin on either a local 
maximum or minimum, we allow for both of these possibilities in our definitions by 
distinguishing between broadening tops and bottoms. 

Definition 2 (Broadening). Broadening tops (BTOP) and bottoms (BBOT) are 
characterized by a sequence of five consecutive local extrema E u E 5 such that: 
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{E x is a maximum 

{ 



{E x is a minimum 

{ 



BTOP = {E X <E 2 <E 5 
{ 

{E 2 >E 4 



BBOT = {E X >E % >E 5 
{ 

{E 2 <E 4 



Definitions for triangle and rectangle patterns follow naturally. 

10 Definition 3 (Triangle). Triangle tops (TTOP) and bottoms (TBOT) are characterized by a 
sequence of five consecutive local extrema E l9 E 5 such that: 



15 



{E x is a maximum 

{ 



{E l is a minimum 

{ 



TTOP = {E^E^Es 
{ 

{E 2 <E 4 



TBOT = {E,<E 3 <E 5 
{ 

{E 2 >E 4 



20 



Definition 4 (Rectangle). Rectangle tops (RTOP) and bottoms (RBOT) are characterized 
by a sequence of five consecutive local extrema E x , E 5 such that: 



25 



30 



35 



{Ej is a maximum 
{ 

RTOP = {tops are within 0.75 percent of their average 
{ 

{bottoms are within 0.75 percent of their average 

{ 

{lowest top > highest bottom, 
{E x is a minimum 

{ 

RBOT = {tops are within 0.75 percent of their average 

{ 

{bottoms are within 0.75 percent of their average 

{ 

{lowest top > highest bottom, 
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The definition for double tops and bottoms is slightly more involved. Consider first the 
double top. Starting at a local maximum E u we locate the highest local maximum £ a 
occurring after E } in the set of all local extrema in the sample. We require that the two tops, 
£j and £ a , be within 1.5 percent of their average. Finally, following Edwards and Magee 
5 (1966), we require that the two tops occur at least a month, or 22 trading days, apart. 
Therefore, we have the following definition. 



10 



15 



Definition 5 (Double Top and Bottom). Double tops (DTOP) and bottoms (DBOT) are 
characterized by an initial local extremum E x and a subsequent local extrema E & and E h such 
that 

£ a =sup{i* : t\ > t* l9 k=2,....,n} 

k 

E h ^mf{P* t : t\ > ** b * = 2, 

k 

and 



20 



25 



{E x is a maximum 

{ 

DTOP 3 {E l and E a are within 1 .5 percent of their average 

{ 

{t\- f*!>22 



{E A is a minimum 

{ 

DBOT = {E x and E b are within 1 .5 percent of their average 
30 { 

{t\- t\>22 

Our algorithm begins with a sample of prices {P„...,P t } for which we fit kernel 
regressions, one for each subsample or window from t to t + / +d~ 1 , where t varies from 1 
35 to T- 1 ~d+ 1 , and / and d are fixed parameters whose purpose is explained below. In the 
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empirical analysis of Section III, we set /=35 and d=3, hence each window consists of 38 
trading days. 

The motivation for fitting kernel regressions to rolling windows of data is to narrow 
our focus to patterns that are completed within the span of the window- / +d trading days in 

5 our case. If we fit a single kernel regression to the entire dataset, many patterns of various 

durations may emerge, and without imposing some additional structure on the nature of the 
patterns, it is virtually impossible to distinguish signal from noise in this case. Therefore, 
our algorithm fixes the length of the window at / +d, but kernel regressions are estimated on 
a rolling basis and we search for patterns in each window. 

1 0 Of course, for any fixed window, we can only find patterns that are completed within 

/ +d trading days. Without further structure on the systematic component of prices m(-), this 
is a restriction that any empirical analysis must contend with. We choose a shorter window 
length of / =35 trading days to focus on short-horizon patterns that may be more relevant for 
active equity traders, and leave the analysis of longer-horizon patterns to future research. 

1 5 The parameter d controls for the fact that in practice we do not observe a realization 

of a given pattern as soon as it has completed. Instead, we assume that there may be a lag 
between the pattern completion and the time of pattern detection. To account for this lag, 
we require that the final extremum that completes a pattern occurs on day t + 1 ~ 1 ; hence d is 
the number of days following the completion of a pattern that must pass before the pattern is 

20 detected. This will become more important in Section III when we compute conditional 
returns, conditioned on the realization of each pattern. In particular, we compute post- 
pattern returns starting from the end of trading day t+l+d, i.e., one day after the pattern has 
completed. For example, if we determine that a head-and-shoulder pattern has completed on 
day t + J _ j (having use d prices from time t through time t + / +d~ 1), we compute the 

25 conditional one-day gross return as Z t - Y l+}+d+l /Y l+l+d . Hence, we do not use any 
forward information in computing returns conditional on pattern completion. In other 
words, the lag d ensures that we are computing our conditional returns completely out-of- 
sample and without any "look-ahead" bias. 
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Within each window, we estimate a kernel regression using the prices in that 
window, hence: 



t+l+d-1 

Z K*(7-*)P, (12) 



s=t 



10 t+l+d-l , t=l,..., T-l-d + l, 

s=t 

15 

where K h (z) is given in (8) and h is the bandwidth parameter (see Section C). It is clear that 
m h { r) is a differentiable function of r. 

20 Once the function m h ( r) has been computed, its local extrema can be readily 

identified by finding times r such that Sgn(/w' h {r)) = -Sgn(/w' A (T + l)) 9 where m' k denotes 
the derivative of m h with respect to rand Sgn(-) is the signum function. If the signs of 
m' k ( r) and m\{ r+ 1) are +1 and - 1 , respectively, then we have found a local maximum, 
and if they are - 1 and +1 , respectively, then we have found a local minimum. Once such a 

25 time r has been identified, we proceed to identify a maximum or minimum in the original 
price series {P t } in the range [t~l, * + 1], and the extrema in the original price series are 
used to determine whether or not a pattern has occurred according to the above definitions. 

If m' h {f) = 0 for a given r, which occurs if closing prices stay the same for several 
consecutive days, we need to check whether the price we have found is a local minimum or 

30 maximum. We look for the date s such that s = inf { s > r : m f h (s) * 0} . We then apply the 
same method as discussed above, except here we compare Sgn(m f h (i — 1) and Sgn(m' h (s)). 

One useful consequence of this algorithm is that the series of extrema which it 
identifies contains alternating minima and maxima. That is, if the ^ extremumisa 
maximum, then it is always the case that the (fc + l) th extremum is a minimum, and vice 

35 versa. 

An important advantage of using this kernel regression approach to identify patterns 
is the fact that it ignores extrema that are "too local" For example, a simpler alternative is 
to identify local extrema from the raw price data directly, i.e., identify a price P t as a local 
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maximum if P t _ 7 < P t and P t > P t +J , and vice versa for a local minimum. The problem 
with this approach is that it identifies too many extrema, and also yields patterns that are not 
visually consistent with the kind of patterns that technical analysts find compelling. 

Once we have identified all of the local extrema in the window [f, t + / 1], we 
5 can proceed to check for the presence of the various technical patterns using the definitions 
provided above. This procedure is then repeated for the next window + +d\ 9 and 
continues until the end of the sample is reached at the window [T-l~d+\ 9 T\. 

Although the invention has been described in detail for the purpose of illustration, it 
is to be understood that such detail is solely for that purpose and that variations can be made 
1 0 therein by those skilled in the art without departing from the spirit and scope of the 

invention. 
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